SYSTEM AND METHOD FOR SWITCHING BETWEEN COMMAND- 
MODE AND TEXT-MODE IN VOICE-RECOGNITION SYSTEMS 
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PRIORITY NOTICE 



This Non-Provisional U.S. Patent Application claims the benefit of the July 
14, 2000 filing date of Provisional U.S. Patent Application Serial Number 
60/218,350. 



The present invention relates to voice recognition systems, and more 
particularly to a system and method for enabling such systems to differentiate 
between spoken system commands and spoken text for literal insertion into a 
software application. 



Voice-recognition systems are gradually becoming an effective 
replacement for keyboards used in computing systems. In a typical voice 
recognition system, the user installs a voice recognition software, then trains the 

20 system to recognize his or her voice. Thereafter, instead of using a keyboard to 
input commands and/ or text to the system, the user speaks into a microphone 
that is typically worn either as a headset or extends from a pedestal on the user's 
desk. Current examples of voice-recognition software include IBM's " ViaVoice," 
Lernout & Hauspie's "VoiceXpress," and Dragon System's "Dragon 

25 NaturallySpeaking." 



FIELD OF THE INVENTION 
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BACKGROUND OF THE INVENTION 
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A significant problem encountered in conventional voice recognition 
systems is the "text-mode/ command-mode" problem in which such systems 
have difficulty knowing whether to interpret the user's words as system 
commands or insertable text. For example, if while using Microsoft Word the 

5 user says "open Web browser/ 7 voice recognition systems have difficulty 
knowing whether the user wants to open a Web browser as a separate 
application, or insert text into a Microsoft Word document. Conventional voice 
recognition systems' solution to this problem is the "command phrase 
dictionary" by which the system listens for certain exact command phrases 

10 which are interpreted as commands rather than text. However, the "command 
phrase dictionary" approach has several drawbacks. One significant drawback is 
that it makes the voice recognition system much harder to learn and use. Instead 
of letting the user use natural language to command the voice recognition 
system, it requires the user to memorize many command phrases, some of them 

15 unnatural, before he or she can use the system effectively. The reason "command 
phrase dictionaries" must often use unnatural language is that the most obvious 
phrases may be commonly used in dictated text. For example, some systems 
require a user to say "scratch that" rather than "delete" in order to delete a 
phrase, and "microphone off", rather than "turn the microphone off" or "mike 

20 off", in order to turn off the microphone. Incorrect command phrases will often 
be interpreted as text for insertion. 
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Another drawback of the "command phrase dictionary" approach is that, 
because there is no standard command phrase dictionary among competing 
voice-recognition system providers, a user must learn multiple command phrase 
dictionaries if he/she is using different voice recognition systems. Yet another 

5 drawback is that the "command phrase dictionary" approach makes the voice 
recognition system slower. When the voice recognition system hears any word 
that might be the start of a command phrase, the system must wait to hear the 
entire phrase before it knows whether or not the user's words are intended as 
text-mode or command-mode. 

10 It is therefore an object of the present invention to provide a system and 

method for use in a voice recognition system that will enable the system to know 
before a user speaks whether the spoken words are to be interpreted as system 
commands or insertable text. It is also an object of the present invention to 
provide a system and method that will enable users to issue commands using 

15 natural language rather than having to memorize specific command phrases, 
thereby eliminating the problem of non-standardized "command phrase 
dictionaries" posed by competing voice recognition systems. Thus, it is an object 
of the present invention to make voice-recognition systems easier and faster to 
use than conventional voice-recognition systems. 

20 
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SUMMARY 



A system and method for switching between the command-mode of 
operation and the text-mode of operation in voice-recognition systems. The 
present invention is implemented using software and a mechanical switch, 

5 wherein the switch can be implemented in a computer mouse, a microphone, or a 
remote device that combines the functionality of both a computer mouse and a 
microphone. When depressed in a first manner, this switch engages the voice- 
recognition system in the text-mode, and when depressed in a second manner it 
engages the voice-recognition system in the command-mode. Alternatively, the 

10 present invention can be implemented using software alone without a 
mechanical switch, i.e., a software switch, wherein spoken commands switch the 
voice-recognition system between the command-mode and the text-mode. 

BRIEF DESCRIPTION OF THE DRAWINGS 
15 FIG. 1 shows a first exemplary embodiment of an apparatus for switching 

between the command-mode of operation and the text-mode of operation in a 
voice-recognition system. 

FIG. 2 shows a second exemplary embodiment of an apparatus for 
switching between the command-mode of operation and the text-mode of 
20 operation in a voice-recognition system. 




FIG. 3 shows a third exemplary embodiment of an apparatus for 
switching between the command-mode of operation and the text-mode of 
operation in a voice-recognition system. 

FIG. 4 shows a flowchart depicting the operation of the present invention. 

5 

DETAILED DESCRIPTION OF THE INVENTION 
The present invention is a system and method for switching between the 
command-mode of operation and the text-mode of operation in voice- 
recognition systems. The system of the present invention is comprised of voice- 

10 recognition software and an apparatus for selectively switching between the 
command-mode and the text-mode. FIG. 1 shows a first exemplary embodiment 
of an apparatus 10 for switching between the command-mode and the text-mode 
in voice-recognition systems according to the present invention. Apparatus 10 is 
implemented as a computer mouse which can be fabricated in any shape, have 

15 any number of mouse buttons, and have a page up/ down scroll wheel between 
two of the buttons. Apparatus 10 is used with a microphone which is not shown 
and into which a user speaks. This microphone may or may not have a 
command-mode/ text-mode switch to operate a computer, and can be any type 
of microphone such as a handheld, desk-mounted or headset microphone, or a 

20 remote combination microphone-mouse/ cursor control device. Alternatively, the 
microphone may be implemented on apparatus 10. Apparatus 10 is coupled to a 




computer by a cable. Alternatively, apparatus 10 can be a wireless device that 
operates a computer at any frequencies permitting wireless communications. 

Apparatus 10 includes a mechanical command-mode/ text-mode switch 12 
for switching between the command-mode and the text-mode of operation. 
5 Switch 12 is a spring-controlled slide-switch that can be slidably moved and is 
positioned on the left side of the mouse where it can easily be controlled by a 
user's thumb. Switch 12 is a three-mode switch having three operating positions. 
The first position is the default position, wherein switch 12 is centered between 
the command-mode position and text-mode position. When switch 12 is in the 

10 default position the microphone is turned off. If the user uses his thumb to push 
switch 12 forward towards the mouse-buttons end of the mouse, the microphone 
is turned on in the command-mode. Thereafter, when the user releases switch 12, 
it automatically slides back to the default center position and the microphone is 
turned off. If the user uses his thumb to pull switch 12 backwards towards the 

15 rear end of the mouse, the microphone is turned on in the text-mode. Thereafter, 
when the user releases switch 12 it automatically slides back to the default center 
position and the microphone is turned off. Alternatively, the operating positions 
of switch 12 can be altered from those described above. 

In an alternative embodiment of apparatus 10, command-mode/ text- 

20 mode switch 12 can be implemented as a two-mode switch having two operating 
positions, i.e., command-mode and text-mode, with a separate control element 
being used to turn the microphone on and off. In yet another alternative 
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embodiment of apparatus 10, command-mode/ text-mode switch 12 can be 
implemented using at least one button and/ or at least one touch-sensitive touch 
pad, which button and/ or touch pad can be positioned anywhere on the mouse. 
FIG. 2 shows a second exemplary embodiment of an apparatus 20 for 
5 switching between the command-mode of operation and the text-mode of 
operation in voice-recognition systems according to the present invention. 
Apparatus 20 is implemented as a microphone which can be shaped like any 
hand-held or desk-mounted microphone and can have any number of buttons. 
Apparatus 20 is used with a mouse which is not shown and which may not have 

10 a command-mode/ text-mode switch to operate a computer. Apparatus 20 is 
coupled to a computer by a cable. Alternatively, apparatus 20 can be a wireless 
device that operates a computer at any frequencies permitting wireless 
communications. Apparatus 20 includes a microphone 22 into which a user 
speaks and a command-mode/ text-mode switch 24 that works in a similar 

15 manner to command-mode/ text-mode switch 12 described above. 

FIG. 3 shows a third exemplary embodiment of an apparatus 30 for 
switching between the command-mode of operation and the text-mode of 
operation in voice-recognition systems according to the present invention. 
Apparatus 30 is implemented as a hand-held personal digital assistant/ organizer 

20 such as a Palm Pilot. Apparatus 30 can be shaped like any hand-held organizer 
and can have any number of buttons and/ or touch sensitive touch pads. 
Apparatus 30 is coupled to a computer by a cable. Alternatively, apparatus 30 
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can be a wireless device that that operates a computer at any frequencies 
permitting wireless communications. 

Apparatus 30 includes a microphone 32 into which a user speaks, and a 
command-mode button 34 and a text-mode button 36, which respective buttons a 

5 user presses and holds depending on whether they want to issue commands to a 
computer or insert text into a software application. Apparatus 30 can also include 
buttons and/ or a touchpad for controlling cursor movement and/ or replicating 
the functions of a computer mouse button, that works in a similar manner to 
command-mode/ text-mode switch 12 described above. 

10 Apparatus 10, 20 and 30 are implemented using conventional hardware 

and/ or software which enable such devices to communicate with computers and 
other electronic appliances/ devices. Apparatus 10, 20 and 30 are designed to 
operate computers using existing software operating systems, such as Microsoft 
Windows, and existing software applications, such as Microsoft Word or a web 

15 browser. Apparatus 10, 20 and 30 can communicate with computers over any 
frequencies that permit wireless, remote communications, including infrared and 
radio frequencies. Alternatively, apparatus 10, 20 and 30 can be coupled to the 
computer using a cable. Apparatus 10, 20 and 30 can each be used to sequentially 
operate multiple computers and/ or appliances/ devices. Apparatus 10, 20 and 30 

20 can be used to operate computers and appliances/ devices as long as the 
appropriate drivers and communication hardware and protocols are installed on 
such computers and appliances/ devices. 
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Apparatus 10, 20 and 30 can be used with headgear such as goggles, 
monocles and glasses that are worn on the head to operate a computer. In 
addition, multiple persons each wearing a separate apparatus 10, 20 and 30 can 
simultaneously play a game or jointly operate an application on the computer. 



without a mechanical switch, i.e., a software switch, wherein commands spoken 
by the user switch the voice-recognition system between the command-mode 
and the text-mode. For example, the software can be written to enable a user to 
affect such switching by speaking the words "command-mode" or "text-mode." 

10 Alternatively, other command words can be used to switch between modes. The 
software switch can be implemented in apparatus 10, 20, and/ or 30 in addition 
to or in lieu of mechanical switches 12, 24, and 32 and 34, respectively. 

The present invention can prompt a user to clarify a command when the 
system is unable to understand a spoken command. For example, if while in the 

15 command-mode the voice-recognition system does not understand a command 
spoken by the user, the system can prompt the user to clarify the command by 
selecting the intended command from one of a plurality of commands displayed 
on the computer display device that the system believes the user intended to 
issue. The user can use apparatus 10, 20 or 30 to select the intended command by 

20 issuing a spoken command. Alternatively, the user can select the intended 
command by pressing a button or touching a touch pad on apparatus 10, 20 or 
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The present^ invention can also be implemented using software alone 



30. 
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FIG. 4 shows a flowchart depicting the operation of the present invention, 
wherein the system is implemented using apparatus 10. At step 1, a user slides 
switch 12 to the command-mode position to engage the voice-recognition system 
in the command-mode of operation thereby also turning on the microphone (not 
5 shown). At step 2, while holding switch 12 at the command-mode position, the 
user speaks into the microphone and commands the computer to open a 
software application. At step 3, the user slides switch 12 from the command- 
mode position to the text-mode position to engage the voice-recognition system 
in the text-mode of operation. The Microphone is only turned on when switch 12 

10 is at either the command-mode or text-mode positions. The microphone is turned 
off when switch 12 is moved between the command-mode and text-mode 
positions. At step 4, while holding switch 12 at the text-mode position, the user 
speaks the text to be inserted into the software application. At step 5, the user 
slides switch 12 from the text-mode position back to the command-mode 

15 position. At step 6, while holding the switch at the command-mode position, the 
user commands the computer to save the file and close the application. At step 7, 
the user releases switch 12 which returns to its default position at which the 
microphone is also turned off. The steps for using the present invention with 
either apparatus 20 or 30 are similar to those just described. 

20 The aforementioned components comprising apparatus 10, 20 and 30 can 

be fabricated from any material, and apparatus 10, 20 and 30 can be fabricated in 
any shape and any size. Numerous modifications to and alternative 



embodiments of the present invention will be apparent to those skilled to the art 
in view of the foregoing description. Accordingly, this description is to be 
construed as illustrative only and is for the purpose of teaching those skilled in 
the art the best mode of carrying out the invention. Details of the structure may 
be varied substantially without departing from the spirit of the invention and the 
exclusive use of all modifications which come within the scope of the appended 
claims is reserved. 
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